智能论文笔记

Semantics-driven Attentive Few-shot Learning over Clean and Noisy Samples

Orhun Buğra Baran , Ramazan Gökberk Cinbiş

分类：计算机视觉

2022-01-09

在过去的几年里，几年枪支学习（FSL）引起了极大的关注，以最大限度地减少标有标记的训练示例的依赖。FSL中固有的困难是处理每个课程的培训样本太少的含糊不清的歧义。为了在FSL中解决这一基本挑战，我们的目标是培训可以利用关于新颖类别的先前语义知识来引导分类器合成过程的元学习模型。特别是，我们提出了语义调节的特征注意力和样本注意机制，估计表示尺寸和培训实例的重要性。我们还研究了FSL的样本噪声问题，以便在更现实和不完美的环境中利用Meta-Meverys。我们的实验结果展示了所提出的语义FSL模型的有效性，而没有样品噪声。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation

Mustafa Sercan Amac , Ahmet Sencan , Orhun Bugra Baran , Nazli Ikizler-Cinbis , Ramazan Gokberk Cinbis

分类：计算机视觉

2021-10-23

就像其他少量学习问题一样，很少拍摄的细分旨在最大限度地减少手动注释的需求，这在分割任务中特别昂贵。即使少量拍摄设置降低了新型测试类的这种成本，仍然需要注释培训数据。为了减轻这种需求，我们提出了一种自我监督的培训方法，用于学习几次射门分割模型。我们首先使用无监督的显着性估计来获得图像上的伪掩码。然后，我们将在不同的伪掩模的不同分割和增强图像的不同分裂上培训一个简单的原型模型。我们广泛的实验表明，该方法达到了有希望的结果，突出了自我监督培训的潜力。据我们所知，这是第一个解决自然图像上无监督的少量分割问题的第一项工作。

translated by 谷歌翻译

DSI2I: Dense Style for Unpaired Image-to-Image Translation

Baran Ozaydin , Tong Zhang , Sabine Susstrunk , Mathieu Salzmann

分类：计算机视觉

2022-12-26

Unpaired exemplar-based image-to-image (UEI2I) translation aims to translate a source image to a target image domain with the style of a target image exemplar, without ground-truth input-translation pairs. Existing UEI2I methods represent style using either a global, image-level feature vector, or one vector per object instance/class but requiring knowledge of the scene semantics. Here, by contrast, we propose to represent style as a dense feature map, allowing for a finer-grained transfer to the source image without requiring any external semantic information. We then rely on perceptual and adversarial losses to disentangle our dense style and content representations, and exploit unsupervised cross-domain semantic correspondences to warp the exemplar style to the source content. We demonstrate the effectiveness of our method on two datasets using standard metrics together with a new localized style metric measuring style similarity in a class-wise manner. Our results evidence that the translations produced by our approach are more diverse and closer to the exemplars than those of the state-of-the-art methods while nonetheless preserving the source content.

translated by 谷歌翻译

Context-Aware Ensemble Learning for Time Series

Arda Fazla , Mustafa Enes Aydin , Orhun Tamyigit , Suleyman Serdar Kozat

分类：机器学习

2022-11-30

We investigate ensemble methods for prediction in an online setting. Unlike all the literature in ensembling, for the first time, we introduce a new approach using a meta learner that effectively combines the base model predictions via using a superset of the features that is the union of the base models' feature vectors instead of the predictions themselves. Here, our model does not use the predictions of the base models as inputs to a machine learning algorithm, but choose the best possible combination at each time step based on the state of the problem. We explore three different constraint spaces for the ensembling of the base learners that linearly combines the base predictions, which are convex combinations where the components of the ensembling vector are all nonnegative and sum up to 1; affine combinations where the weight vector components are required to sum up to 1; and the unconstrained combinations where the components are free to take any real value. The constraints are both theoretically analyzed under known statistics and integrated into the learning procedure of the meta learner as a part of the optimization in an automated manner. To show the practical efficiency of the proposed method, we employ a gradient-boosted decision tree and a multi-layer perceptron separately as the meta learners. Our framework is generic so that one can use other machine learning architectures as the ensembler as long as they allow for a custom differentiable loss for minimization. We demonstrate the learning behavior of our algorithm on synthetic data and the significant performance improvements over the conventional methods over various real life datasets, extensively used in the well-known data competitions. Furthermore, we openly share the source code of the proposed method to facilitate further research and comparison.

translated by 谷歌翻译

DSR: Towards Drone Image Super-Resolution

Xiaoyu Lin , Baran Ozaydin , Vidit Vidit , Majed El Helou , Sabine Süsstrunk

分类：计算机视觉

2022-08-25

尽管近年来取得了显着的进展，但开发了几个局限性的单像超分辨率方法。具体而言，它们在具有某些降解（合成还是真实）的固定内容域中进行了培训。他们所学的先验容易过度适应培训配置。因此，目前尚不清楚对新型领域（例如无人机顶视图数据以及跨海）的概括。尽管如此，将无人机与正确的图像超分辨率配对具有巨大的价值。这将使无人机能够飞行更高的覆盖范围，同时保持高图像质量。为了回答这些问题，并为无人机图像超级分辨率铺平了道路，我们探索了该应用程序，特别关注单像案例。我们提出了一个新颖的无人机图像数据集，其场景在低分辨率和高分辨率下捕获，并在高度范围内捕获。我们的结果表明，现成的最先进的网络见证了这个不同领域的性能下降。我们还表明了简单的微调，并将高度意识纳入网络的体系结构，都可以改善重建性能。

translated by 谷歌翻译

A two-step machine learning approach to statistical post-processing of weather forecasts for power generation

Ágnes Baran , Sándor Baran

分类： (统计)机器学习 | 机器学习

2022-07-15

到2021年底，全球电力容量的可再生能源份额达到38.3％，新设施以风能和太阳能为主，分别显示全球增长12.7％和18.5％。但是，风能和光伏能源都是高度挥发性的，使得对网格操作员的计划很难，因此对相应天气变量的准确预测对于可靠的电力预测至关重要。天气预测中最先进的方法是合奏方法，它为概率预测打开了大门。尽管合奏预测通常不足，并且会遭受系统的偏见。因此，它们需要某种形式的统计后处理，其中参数模型提供了手头天气变量的完整预测分布。我们提出了一种基于两步机的一般学习方法，用于校准集合天气预报，在第一步中，生成了改进点的预测，然后将其与各种合奏统计数据一起作为神经网络的输入特征，估计估计的参数。预测分布。在两个案例研究中，基于100m风速和全球水平辐照度预测匈牙利气象服务的操作集合词典系统，将这种新颖方法的预测性能与原始合奏的预测技能进行了比较ART参数方法。两种案例研究都证实，至少高达48H统计后处理可实质上改善了所有被考虑的预测范围的原始合奏的预测性能。所提出的两步方法的研究变体在其竞争对手方面优于技能，建议的新方法非常适用于不同的天气数量和广泛的预测分布。

translated by 谷歌翻译

The use of deep learning enables high diagnostic accuracy in detecting syndesmotic instability on weight-bearing CT scanning

Alireza Borjali , Soheil Ashkani-Esfahani , Rohan Bhimani , Daniel Guss , Orhun K. Muratoglu , Christopher W. DiGiovanni , Kartik Mangudi Varadarajan , Bart Lubberts

分类：计算机视觉 | 机器学习

2022-07-07

延迟的诊断联合性不稳定会导致踝关节的显着发病和关节炎的加速变化。使用3D体积测量值，重量计算机断层扫描（WBCT）已显示出有希望的早期和可靠检测分离出的集团不稳定性的潜力。尽管据报道这些测量值高度准确，但它们也依赖于经验，耗时，并且需要一种特定的3D测量软件工具，该工具导致临床医生仍然对传统的诊断方法表现出更大的兴趣。这项研究的目的是通过使用WBCT扫描来自动化3D体积解剖结构的3D体积评估来提高准确性，加速分析时间并减少观察者间偏置。我们使用了先前收集的单侧联合不稳定性患者的WBCT扫描进行了回顾性研究。评估了144个双侧踝WBCT扫描（48个不稳定，96个对照）。我们开发了三个深度学习（DL）模型，用于分析WBCT扫描以识别集团不稳定性。这三个模型包括两个最先进的模型（模型1-3D卷积神经网络[CNN]和具有长短期内存[LSTM]的模型2-CNN）和一个新的模型（模型3-差分差异我们在这项研究中介绍的CNN LSTM）。模型1未能分析WBCT扫描（F1得分= 0）。模型2仅错误分类两种情况（F1得分= 0.80）。模型3的表现优于模型2，并实现了几乎完美的性能，在对照组中仅误导了一个情况（F1得分= 0.91），因为不稳定，而比模型2更快。

translated by 谷歌翻译

ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset

Moritz Roman Hernandez Petzsche , Ezequiel de la Rosa , Uta Hanning , Roland Wiest , Waldo Enrique Valenzuela Pinilla , Mauricio Reyes , Maria Ines Meyer , Sook-Lei Liew , Florian Kofler , Ivan Ezhov

分类：计算机视觉

2022-06-14

磁共振成像（MRI）是中风成像的中心方式。它被用来接受患者的治疗决定，例如选择患者进行静脉溶栓或血管内治疗。随后在住院期间使用MRI来通过可视化梗塞核心大小和位置来预测结果。此外，它可以用来表征中风病因，例如（心脏） - 栓塞和非胚胎中风之间的区分。基于计算机的自动医疗图像处理越来越多地进入临床常规。缺血性中风病变分割（ISLE）挑战的先前迭代有助于生成鉴定急性和急性缺血性中风病变分割的基准方法。在这里，我们介绍了一个专家注册的多中心MRI数据集，以分割急性到亚急性中风病变。该数据集包括400个多供应商MRI案例，中风病变大小，数量和位置的可变性很高。它分为n = 250的训练数据集和n = 150的测试数据集。所有培训数据将公开可用。测试数据集将仅用于模型验证，并且不会向公众发布。该数据集是Isles 2022挑战的基础，目的是找到算法方法，以实现缺血性中风的稳健和准确分割算法的开发和基准测试。

translated by 谷歌翻译

End-to-End Learning of Multi-category 3D Pose and Shape Estimation

Yigit Baran Can , Alexander Liniger , Danda Pani Paudel , Luc Van Gool

分类：计算机视觉

2021-12-19

在本文中，我们研究了使用它们的关键点的形状和姿势的表示。因此，我们提出了一种端到端的方法，其同时从图像中检测2D关键点并将它们升到3D。该方法仅从2D关键点注释学习2D检测和3D升降。在这方面，提出了一种通过基于增强的循环自我监督来明确地解除姿势和3D形状的新方法。除了在图像到3D学习的图像端到端，我们的方法还使用单个神经网络处理来自多个类别的对象。我们使用基于变换器的架构来检测关键点，以及总结图像的视觉上下文。然后将该视觉上下文信息升降到3D时，以允许基于上下文的推理以获得更好的性能。在提升时，我们的方法学习一小一小一组基础形状和稀疏的非负系数，以表示规范框架中的3D形状。我们的方法可以处理闭塞以及各种各样的对象类。我们对三个基准测试的实验表明，我们的方法比现有技术更好。我们的源代码将公开可用。

translated by 谷歌翻译